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ABSTRACT 


When  a  collection  of  processors  C:={ P,,  •  •  •  ,P S)  operates 
in  parallel,  it  is '.desirable  that  at  any  given  Stage  of  the 
computation,  each  P  .'n  should  have  a  tc.sk  of  about  the  same 
size  to  perform,  ani  each  P.  should  require  about  the  same 
amount  of  information  from  the  other  P's  in  order  to  perform 
its  task.  To  the  extent  that  these  conditions  are  violated, 
parallelism  is  impaired,  in  the  cense  that  the  P's  are  not 
all  used  with  equal  efficiency.  In  cellular  computers,  e.g. 
as  they  might  be  used  for  parallel  image  processing,  these 
conditions  are  maintained  by  having  the  P's  all  perform 
similar  computations  on  different  parts  of  the  input  data,  and 
by  allowing  each  P.  to  receive,  information  from  a  fixed  set 
of  the  others  (its1^' neighbors'^)  ,  where  these  sets  are  all  of 
bounded  size.  This  paper  discusses,  on  an  abstract  level,  the 
concept  of  a  reconf igurable  cellular  computer,  in  which  each 
P.  can  receive  information  from  a  set  S  /  of  the  other  P's, 
and  the  S.'s  are  all  of  bounded  size,  bit  they  need  not  remain 
fixed,  thr&ugfytout  a  computation.  Requiring  the  S.'s  to  have 
bounded  size  implies  that  most  P's  cannot  communiiate  directly; 

.  the  expected  time  required  for  two  arbitrary  P's  to  communicate 
k  depends  on  the  graph  structure  defined  by  the  sets  S..  The 
question  of  how  to  change  the  S.'s  in  parallel  during  the 
course  of  a  computation  is  also1discussed. 


1 .  Parallelism  and  cellular  computers 


Let  C={P^, *  *  * , Pn)  be  a  collection  of  processors  operating 
in  parallel.  In  general,  we  can  regard  each  P^  as  performing 
a  sequence  of  computational  tasks,  and  at  the  end  of  each  task, 
providing  new  information  t.o  other  P's  and  requesting  new 
information  from  other  P's.  In  order  to  make  efficient  parallel 
use  of  the  P's,  we  would  like  all  of  them  to  be  active  as  much 
of  the  time  as  possible.  This  suggests  that  we  should  try  to 
make  the  tasks  as  equal  as  possible  in  size,  to  avoid  the  need 
for  some  P^  having  to  waif,  a  long  time  for  a  piece  of  information 
that  some  other  P^  is  still  computing.  Moreover,  we  should 
design  the  tasks  so  that  each  needs  to  give  about  the  same 
amount  of  information  t.o  other  P's,  and  to  receive  about  the 
same  amount  from  other  P's,  between  tasks,  to  avoid  long  I/O 
delays  while  some  P^  is  sending  or  receiving  information. 

Cc'llular  computers  [1-5]  make  efficient  parallel  use  of 
large  numbers  of  processors  by  dividing  up  both  the  computa¬ 
tional  taks  and  the  I/O  requirements  very  equally  among  two 
P's.  On  an  abstract  level,  in  a  cellular  automaton  [6  -10], 
each  computational  otep  is  symbolized  by  a  change  in  a  processor's 
state,  and  the  new  state  depends  on  the  old  states  of  the  pro¬ 
cessor  and  a  fixed  set  of  its  neighbors;  this  corresponds  to 
a  task  (lookup  of  the  new  state)  that  requires  a  fixed  amount 
of  new  data  (the  neighbor's  states)  to  be  input  (and  by  the 
same  token,  a  fixed  amount  of  data  to  be  output:  one's  own 
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state  to  one's  neighbors),  and  a  fixed  amount  of  computation. 

The  same  principles  are  used  in  concrete  realizations  of  cellu¬ 
lar  computers,  as  applied  to  such  tasks  as  image  processing 
[  1  -  5  ] .  One  can  process  an  image  using  a  square  array  of 
P's,  each  of  which  receives  a  block  of  image  data,  with  neigh¬ 
boring  P's  receiving  neighboring  blocks.  The  processing  is 
performed  in  stages,  and  at  the  end  of  each  stage,  neighboring 
P's  exchange  updated  information  about  their  blocks  for  use 
at  the  next  stage.  Thus  here  again,  every  P^  does  essentially 
the  same  amount  of  processing  and  of  I/O  from/to  neighboring 
P's,  except  that  the  amounts  are  somewhat  less  at  the  borders 
of  the  array.  More  general  examples  could  be  given  in  which 
the  P's  are  connected  to  form  a  fixed  graph  structure  (rather 
than  an  array  structure) ,  and  are  used  to  simulate  interactions 
among  the  nodes  of  the  graph  (see  [10])  ;  note  that  here,  too, 
we  would  want  each  node  to  have  about  the  same  number  of  neigh¬ 
bors,  to  keep  their  I/O  requirements  comparable. 

It  has  usually  been  assumed,  in  studying  cellular  computers, 
that  the  number  of  "neighboring"  P's  with  which  a  given  P^  can 
communicate  directly  is  bounded  — i.e.,  if  we  represent  the  P's 
by  the  nodes  of  a  graph,  and  join  neighboring  P's  by  arcs,  the 
resulting  graph  has  bounded  degree,  which  does  not  grow  with  the 
number  of  P's.  This  assumption  is  very  reasonable  if  we  regard 
neighboring  P's  as  hardwired  together;  the  number  of  I/C  ports 
available  to  a  given  P^  will  be  limited,  no  matter  how  many  P's 
there  are.  But  even  if  we  do  not  assume  hardwired  connections. 


f  l  v.  t-5  i.4  TSSfirill  '  fit  tf-Vi  ' 1  --i>'  f . 


<  -vi*  *;.n 


it  is  still  reasonable  to  require  the  number  of  neighbors 
of  each  P^  to  be  bounded,  in  order  to  put  a  bound  on  the 
amount  of  I/O  that  each  P^  can  do  at  a  given  stage  of  the 
computation.  If  we  do  not  impose  such  a  bound,  different 
P's  may  require  very  different  I/O  delays,  since  some  of  them 
may  need  to  output  or  receive  much  more  information  than  others, 
so  that  once  again  there  is  danger  of  serious  loss  of  paral¬ 
lelism. 

In  conventional  cellular  computers  this  graph  structure 
defining  the  "neighbor"  relations  between  P's  is  not  only 
of  bounded  degree,  but  is  also  assumed  to  remain  fixed  in  the 
course  of  a  computation;  this  allows  us  to  regard  the  neigh¬ 
boring  P's  as  hardwired  together.  In  this  paper,  we  consider 
the  possibility  of  reconf igurable  cellular  computers  in  which 
the  set  of  neighbors  of  each  P^  can  change  during  the  computa¬ 
tion,  but  their  number  remains  bounded.  We  do  not  consider 
here  how  direct  communication  is  physically  realized;  we  simply 
assume  that  each  P^  has  a  list  of  "addresses"  of  those  P^'s  with 
which  it  can  currently  communicate  directly,  and  that  this  list 
always  remains  of  fixed  size.  (For  the  sake  of  concreteness 
[  11  ] ,  we  can  imagine  that  P^  communicates  with  P^  by  putting 
a  message  addressed  to  P^  on  a  very  fast  bus.)  We  also  assume 
that  all  communication  is  potentially  two-way,  i.e.,  if  P^ 
can  address  P^,  then  P^  can  address  P^,  and  conversely. 

When  we  assume,  in  a  cellular  computer,  that  the  nodes  are 
of  bounded  degree,  we  are  making  it  easier  to  achieve  efficient 
parallelism,  but  we  are  also  introducing  a  potential  speed 


limitation  due  to  the  time  that  may  now  be  required  for 
information  to  be  exchanged  between  two  arbitrary  P's* 

A  given  P^  can  communicate  directly  only  with  a  bounded 
subset  of  the  P's,  namely  its  neighbors,  and  if  it  needs 
to  communicate  with  an  arbitrary  P^,  the  message  may  have 
to  be  relayed  through  many  stages.  The  expected  and  worst- 
case  communication  times  between  a  pair  of  P's  depend  on  the 
structure  of  the  graph  that  defines  the  neighbor  relationship; 
examples,  for  various  standard  graph  structures,  are  given  in 
Sec  cion  2.  Evidently,  cellular  computers  are  best  suited  for 
tasks  in  which  each  P^  needs  to  communicate,  for  the  most 
part,  only  with  a  bounded  number  of  others,  and  their  graph 
structures  should  be  designed  so  that,  to  the  extent  possible, 
these  others  are  P^'s  neighbors. 

In  the  case  of  a  reconf igurable  cellular  computer,  another 
problem  arises  when  we  want  to  change  its  graph  structure 
during  a  computation.  If  and  Pj  can  currently  address  one 
another,  it  is  easy  for  them  to  drop  one  another  from  their 
address  lists  by  mutual  agreement.  But  if  and  P^  cannot 
currently  address  one  another,  how  do  they  simultaneously 
add  each  other  to  their  lists?  Section  3  proposes  a  "local" 
approach  to  this  problem,  in  which  P.  and  P.  can  add  each 
other  to  their  lists  only  if  they  currently  have  a  common 
neighbor  P ^ ,  which  they  may  then  simultaneously  drop;  and 
it  is  shown  how,  by  iterating  this  "local  reconfiguration" 
step,  direct  addressing  can  be  established  between  any  two 


desired  P's.  In  Section  4  we  illustrate  this  approach 
by  showing  how  various  standard  graph  structures  can  be 
reconfigured,  in  parallel,  into  other  standard  structures, 
while  maintaining  boundedness  of  degree  throughout. 
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2 .  Communication  time  in  cellular  computers 

Let  G  be  any  undirected  graph,  with  set  of  nodes  NG 
and  set  of  arcs  AG.  Two  nodes  P,Q  are  called  neighbors 
if  (P,Q)<rA G.  By  a  path  of  length  m  between  two  nodes 
P , Q  we  mean  a  sequence  of  nodes  P=Qq , • ,Qm=Q  such  that 
is  a  neighbor  of  Q^_^,  l*i*m.  We  say  that  G  is  connected 
if  there  is  a  path  between  any  two  nodes  of  G.  We  will 
usually  assume  in  what  follows  that  G  is  connected. 

By  the  distance  6(P,Q)  between  P  and  Q  we  mean  the 
shortest  length  of  any  path  between  them.  [It  is  easily 
seen  that  distance  is  a  metric,  i.e.  reflexive  (6(P,Q)= 

0  iff  P=Q) ,  symmetric  [6 (P ,Q) =6 (Q, P)  for  all  P,Q) ,  and 
satisfies  the  triangle  inequality  (6  (P,R)  *6  (P,Q)+6  (Q,R) 
for  all  P,Q,R,).l  The  greatest  distance  between  any  two 
nodes  of  G  is  called  the  diameter  of  G,  denoted  A(G) , 
and  the  expected  distance  between  two  randomly  chosen  nodes 
of  G  is  called  the  expected  diameter  of  G,  denoted  E (G) . 

Let  C  be  a  cellular  computer  with  set  of  processors 
{p^,***,Pn},  and  let  A  be  the  set  of  pairs  of  processors 
that  (currently)  can  directly  communicate  with  each  other. 

If  we  let  Ng={  Pi, • • • ,Pn)  and  AG=A,  we  obtain  an  undirected 
graph  G,  called  the  graph  of  C.  The  degree  of  a  node  P  is 
the  number  of  its  neighbors,  d(P) =| {Q| (P,Q)tAG) j .  We  say 
that  G  has  degree  d  if  d(P)*d  for  all  PfcNG,  where  d  is  as 


small  as  possible.  We  assume  from  now  on  that  the  graph 
of  C  always  has  degreesd  for  some  fixed  d. 

The  expected  amount  of  time  required  for  a  message  to 
get  from  one  randomly  chosen  node  to  another  is  proportional 
to  E (G) ,  and  the  longest  possible  time  for  a  message  to  get 
from  one  node  to  another  is  proportional  to  A(G).  For  a 
given  number  n  of  nodes,  the  values  of  E(G)  and  A(G)  depend 
on  the  graph  structure  of  G.  Table  1  shows  these  values  for 
a  set  of  basic  types  of  graphs.  The  derivations  of  the  E(G) 
values  are  given  in  Appendix  A. 

Table  1  suggests  that  we  can  keep  the  expected  or  max¬ 
imum  communication  time  short  by  using  high-dimensional 
trees  or  arrays  as  graph  structures.  However,  such  struc¬ 
tures  involve  high  node  degrees,  and  the  higher  the  degrees 
are,  the  more  room  there  is  for  differences  between  the 
I/O  requirements  of  different,  nodes.  We  will  therefore 
consider  only  the  low-degree  cases  from  now  on:  string 
and  cycle  (degree*2) ,  binary  tree  (degree*3) ,  and  two- 
dimensional  array  (degree*4)  . 
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Maximum  Expected 


Graph  type 

degree (d) 

Diameter (A) 

diameter (E) 

String 

2 

n-1 

(n+l)/3 

Cycle 

2 

[n/2] 

(n+1) / 4 

Balanced 
binary  tree 

3 

2 ( £-1) 

where  £=log  (n+1) 

2 

U-3)n+24] 

Two-dimensional 

array 

4 

2/rT 

2/rT /3 

Balanced 
k-ary  tree 

k+1 

21 

where  n= (k^+^-l)/ (k- 

1) 

k-dimensional 

array 

2k 

k  Vr 

... 

Table  1.  Values  of  diameter  and 
expected  diameter  for  some  simple 
types  of  graphs,  all  having  n  nodes 


3 .  Reconfiguration  of  cellular  computers 

Suppose  that  and  can  currently  address  one  another, 

and  P^  wants  to  drop  P^  from  ics  address  list.  Then  Pj  must 

drop  P^  from  its  list  simultaneously.  To  insure  this,  P^ 

sends  P_.  a  message  requesting  that  they  drop  each  other; 

acknowledges  and  agrees  to  the  message;  and  they  then  drop 

each  other.  We  assume  here  that  such  messages  are  sent  and 

received  in  a  unit  time  period,  so  that  the  dropping  can 

take  place  simultaneously.  Note  that  when  two  nodes  drop 

each  other,  the  graph  may  become  disconnected;  we  will 

assume  that  normally  this  does  not  happen.  (If  desired,  one 

can  check  that  deletion  of  an  arc  will  not  disconnect  the 

graph  before  actually  deleting  it;  see  [10  ].) 

It  is  less  obvious  how  P^  and  P^  can  add  each  other  to 

their  lists,  if  they  cannot  currently  address  one  another. 

Suppose  first  that  and  P^  have  a  common  neighbor  P^.  The 

sequence  of  events  is  then  as  follows:  P^  (say)  informs 

P.  that  it  wants  to  add  P . ;  P,  asks  P ,  to  add  P . ;  P .  informs 
k  3  k  j  i  3 

P  that  it  agrees;  P,  signals  P..  to  add  P.  and  P_.  to  add  P. 

K  K  i  3  j  i 

simultaneously.  Here  again,  standard  unit  times  are  assumed, 
to  insure  simultaneity.  We  have  also  assumed  that  P^  and 
P_.  both  have  room  to  add  each  other  without  exceeding  the 

J 

degree  bound.  If  this  is  not  so,  we  can  modify  the  construc¬ 
tion  to  make  P .  and  P .  drop  P,  at  the  same  time  they  add  each 
l  3  x 
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other;  this  insures  that  their  degrees  remain  within  the 
bound.  Of  course,  this  assumes  that  there  is  no  objection 
to  disconnecting  from  and  P^. 

In  the  case  of  an  arbitrary  P^  and  P^,  we  proceed  by 
induction  on  the  distance  between  them.  (We  assume  the 
graph  is  connected,  so  that  this  distance  always  exists.) 

If  the  distance  is  1,  they  are  already  neighbors;  if  it  is 
2,  they  have  a  common  neighLor,  and  the  construction  in 
the  previous  paragraph  can  be  used.  Let  the  distance 
between  them  be  m>2,  and  let  P.=Q0,Q, , * • * ,Q  =P .  be  a 
shortest  path  between  them.  Then  p^=Qq  and  Q2  have  the 
common  neighoor  By  the  previous  paragraph,  P^  and  Q2 

can  add  each  other  to  their  lists  and  (if  desired)  can 
drop  from  their  lists.  We  now  have  a  path  P^=QQ ,Q2 f ’ ' * f 

Q  =P.  of  length  m-1,  so  that  the  distance  from  P.  to  P . 

m  j  i  j 

is  now  m-1.  Repeating  this  construction,  we  can  eventually 

r  .ce  the  distance  to  2  and  then  to  at  which  point  P^ 

am  Pj  have  become  neighbors.  As  before,  we  assume  that 

there  is  no  obstacle  to  adding  and  deleting  the  intermediate 

arcs  involved  i..  this  construction. 

The  construction  just  given  shows  only  how  to  create 
an  arc  between  two  arbitrary  given  nodes,  and  assumes  that 
we  are  free  to  create  and  destroy  intermediate  arcs  as 
needed.  During  the  operation  of  a  reconf igurable  cellular 


computer,  many  pairs  of  nodes  may  want  to  connect  or  dis¬ 
connect  themselves  at  the  same  time,  and  it  will  not  in 
general  be  possible  to  carry  out  the  necessary  reconfigur¬ 
ation  steps  simultaneously  without  conflict.  To  demonstrate 
that  the  concept  of  reconfiguration  is  useful,  we  must  show 
how  graph  structures  can  be  nontrivially  reconfigured  in 
parallel.  In  Section  4  we  will  sketch  several  such  recon¬ 
figuration  algorithms  which  allow  conflict-free  parallel 
transformations  between  strings,  cycles,  arrays,  and  trees. 
We  will  generally  assume  in  these  algorithms,  as  is  commonly 
assumed  for  graph-structured  cellular  automata,  that  the 
graph  has  a  distinguished  node. 


4 .  Some  parallel  reconfiguration  algorithms 

4 . 1  Strings  and  cycles 

It  is  trivial  for  a  cycle  to  transform  itself  into  a 
string  by  dropping  an  arc,  e.g.  one  of  the  arcs  incident 
on  the  distinguished  node.  Conversely,  for  a  string  to 
transform  itself  into  a  cycle,  the  node  at  one  end  (which 
we  may  assume  to  be  distinguished)  succesively  connects 
itself  to  the  third,  f ourth, ...  nodes,  using  the  path¬ 
shortening  construction  in  Section  3,  until  it  is  connected 
to  the  other  end;  each  intermediate  arc  used  in  this  con¬ 
struction  is  deleted  as  soon  as  the  next  arc  is  formed.  The 
time  required  to  form  a  string  into  a  cycle  is  proportional 
to  the  length  of  the  string. 

4 . 2  String  to  tree 

For  a  stringf  say  of  length  l,  to  transform  itself  into 
a  balanced  binary  tree,  a  construction  similar  to  that  used 
for  firing  squad  synchronization  can  be  employed.  The  mid¬ 
point  Mq  of  the  string  (or  one  of  the  two  midpoints,  if  l  is 
even)  is  identified  by  sending  two  signals  from  the  (distin¬ 
guished)  end  node,  one  at  unit  speed  and  one  at  1/3  speed; 
the  unit  speed  signal  bounces  back  from  the  other  end  and 
meets  the  1/3  signal  at  Mq.  Next,  we  similarly  find  mid¬ 
points  M^  and  of  the  two  halves  of  the  string,  and  at 


the  same  time,  we  connect  Mg  to  each  of  them;  Mg  is  the 

root  of  the  tree  being  constructed,  and  M  ,*1^  are  ^ts  sons' 

We  now  have  two  substrings  with  midpoints  M^,M2  an<^  we 

repeat  this  process  in  parallel  for  each  of  them,  thus 

joining  to  the  midpoints  of  ^ts  halves»  and 

M2to  the  midpoints  M21,M22  of  halves.  After  log2£ 

repetitions  of  this  procedure,  we  have  constructed  the 

tree.  The  total  time  required  for  the  construction  is 
3  3  £  3  £  3 

about  '2£+2’“2+2"  j2+*  •  •+-2‘ <3£,  proportional  to  £. 


4 . 3  Tree  to  string 

Given  a  binary  tree,  we  can  reconfigure  it  into  a 
string  in  time  proportional  to  the  height  of  the  tree. 

We  do  this  by  repeatedly,  in  parallel,  joining  each  node 
to  the  right  son  of  its  left  son  and  the  left  son  of  its 
right  son,  and  disconnecting  it  from  its  left  and  right 
sons,  where  "left"  and  "right"  refer  to  an  arbitrary 
given  labeling  of  the  sons  of  each  node.  Figure  1  illus¬ 
trates  how  this  process  works  in  a  simple  example.  It  is 
not  hard  to  see  that  when  the  process  terminates,  each 
node  is  joined  to  the  rightmost  descendant  of  its  left 
son  and  the  leftmost  descendant  of  its  right  son,  and 
the  resulting  arcs  define  a  string  which  corresponds  to 
an  inorder  traversal  of  the  tree. 

4 . 4  String  to  array 

We  assume  that  each  node  knows  the  length  £  of  the 
string  anc5  its  own  position  in  the  string.  Let  s  =  \/Z  1, 


and  regard  the  string  as  composed  of  substrings  of  length 
s.  We  join  the  ith  node  of  each  substring  to  the  ith  node 
of  the  following  substring,  1  &  i  *  s,  and  disconnect  the 
last  node  of  each  substring  from  the  first  node  of  the 
following  one.  Evidently,  all  these  joinings  can  take 
place  in  parallel.  [We  do  this  as  follows:  Assume  that 
nodes  nos.  1,  l+2s....  are  specially  marked.  Each  marked 
node  ks+i  joins  itself  to  the  next  marked  node,  using  the 
stepwise  construction  of  Section  3.  As  soon  as  this  pro¬ 
cess  has  passed  node  ks+i,  it  too  starts  a  reconstruction 
process,  which  stops  as  soon  as  it  finds  a  node  that  still 
has  only  two  neighbors  and  occurs  after  a  marked  node; 
this  can  only  be  node  (k+l)s+i-]  The  substrings  csin  be 
regarded  as  the  rows  of  an  array,  and  the  new  arcs  connect 
the  successive  nodes  in  each  column.  If  Z  is  not  a  perfect 
square,  the  last  row  will  be  shorter  than  the  others.  Evi¬ 
dently,  the  joining  process  takes  time  proportional  to  /Z. 

4 . 5  Array  to  tree  or  string 

To  change  an  array  into  a  string,  we  can  simply  build  a 
breadth-first  spanning  tree  of  the  array  with  one  of  the 
corner  nodes  as  root;  readily,  this  tree  is  binary  and  can 
be  constructed  in  such  a  way  that  it  has  height  equal  to 
the  array's  city  block  diameter.  The  construction  of  Sec¬ 
tion  4.3  can  then  be  used  to  convert  the  tree  into  a  string. 
The  process  takes  time  proportional  to  the  array  diameter. 
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For  the  details  of  a  tree  construction  process  that  yields 
a  tree  of  the  desired  height  that  is  as  balanced  as  possi¬ 
ble,  see  Appendix  B. 

4 . 6  Tree  to  array 

A  binary  tree  can  be  converted  into  an  array  by  first 
changing  it  into  a  string  (Section  4.3)  and  then  changing 
the  string  into  an  array  (Section  4.4);  but  the  latter  pro¬ 
cess  takes  o( string  length)  time.  A  more  complicated 
construction  can  be  given  which  requires  only  0 (array  dia¬ 
meter)  time;  for  the  details,  see  Appendix  B.  It  would 
be  of  interest  to  design  an  algorithm  tnat  requires  only 
0(t.ree  height)  time. 


5.  Concluding  remarks 


This  paper  has  suggested  that  it  may  be  of  interest 
to  study  reconf igurable  cellular  computers,  in  which  the 
number  of  processors  that  can  address  a  given  one  is  bound¬ 
ed,  but  the  set  of  these  processors  can  change,  thus  modi¬ 
fying  the  graph  structure  defined  by  the  addressability 
relation.  Examples  were  given  illustrating  how  various 
simple  graph  structures  can  be  reconfigured  into  one 
another  in  parallel. 

Ordinary  cellular  computers  are  applicable  to  compu¬ 
tational  tasks  which  can  be  divided  among  the  processors 
in  such  a  way  that  only  certain  pairs  of  processors  need 
to  interact;  one  would  then  define  the  graph  structure  of 
the  computer  so  as  to  make  these  pairs  neighbors.  More 
generally,  one  could  imagine  a  computational  task  in  which, 
at  various  stages,  different  kinds  of  interprocessor  inter¬ 
actions  are  needed.  Such  a  task  could  be  handled  by  a 
reconf igurable  cellular  computer  which  changed  its  graph 
structure  at  the  end  of  each  stage. 

As  an  example  of  such  a  multistage  task,  let  us  again 
consider  the  domain  of  image  processing.  We  know  that  an 
array-structured  cellular  computer  is  useful  at  an  early 
stage  of  image  analysis,  when  local  operations  are  being 
performed  on  the  image.  The  result  of  this  stage  might  be 
a  segmentation  of  the  image  into  regions,  and  we  might  then 
want  to  perform  further  processing  at  the  region  level,  e.g. 


merging  regions,  or  identifying  particular  configurations 
of  regions  by  matching  against  models.  This  level  of  pro- 
cessing  might  be  best  carried  out  on  a  cellular  computer 
configured  in  such  a  way  that  each  node  represents  a  region, 
and  neighboring  nodes  represent  adjacent  regions.  It  is 
not  difficult  to  define  reconfiguration  algorithms  which, 
given  an  array-structured  cellular  processor  in  which  region 
labels  have  been  attached  to  the  nodes,  can  construct  a 
graph-structed  cellular  processor  representing  the  adjacency 
graph  of  the  regions.  A  paper  describing  such  algorithms 
is  in  preparation  [12-14]. 
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Appendix  A:  Expected  Diameters 

1)  Cycle 

In  a  cycle  of  odd  length  n,  the  sum  of  the  distances 

from  any  given  node  to  the  other  nodes  is 

(n-l)/2  .  , 

21  i  =  . 

i=l 

Hence  the  average  distance  from  an  arbitrary  node  to  any 
other  node  is  the  sum  divided  by  n-1,  or  (n+1) /4 .  If  n  is 
even,  the  sum  is 

n  (n-2)/2 

§  +  2  l  i=  r2 
^  i=l  & 

so  that  the  average  is  n2/4 (n-1) .  Note  that  if  we  include 
the  given  node  itself  (distanced)  in  the  average,  the  deno¬ 
minator  is  r.  rather  than  n-1,  so  that  we  obtain  n/4  in  the 
even  case,  and  (n2-l)/4n  in  the  odd  case. 

2)  Array 

In  an  r*s  rectangle,  the  sum  of  the  distances  from  any 
of  the  corner  nodes  to  the  other  nodes  is 

r1  r\i+j)  =  iSjjLZll  +  «<;.-!>  .  rs  (r+s-2)/2 

i=0  j=0  ^  z 

Hence  in  a  U*V  rectangle,  we  can  find  the  sum  of  the  distances 
from  a  given  node  (u,v)  to  the  other  nodes  by  regarding  (u,v) 
and  three  of  its  neighbors  as  the  corners  of  four  subrectangles: 


t  r*K-> 
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R  .  .  R 

1  2 

.  .  .  (u,  v) (u+1, v)  .  .  . 

.  .  . (u, v-1) (u+1 , v-1) .  .  . 

R  R 

3  4 

•  • 

•  V 

Now  a  node  in  R  or  R  has  distance  from  (u,v)  1  greater  than 

2  3 

its  distance  from  its  own  corner,  while  a  node  in  R  has  dis- 

4 

tance  2  greater.  Hence  the  sum  of  the  distances  from  (u,v) 

is  the  sum  of  the  distances  (+  1  or  2)  from  the  nodes  of  R  , 

i 

R  ,  R  ,  R  to  their  respective  corners.  Since  the  sizes  of 

2  3  4 

R  ,  R  ,  R  ,  R  are  u*v,  (U-u)xv,  ux (V-v) ,  and  (U-u) x (V-v) , 

12  3  4 

respectively,  the  sum  is 

uv  (u+v-2)  ,  (U-u)v[lUU“Bl+V-23 

2  1 

+  u(V-v)  [l+^y-v)-23  +  (u_u)  (V_v)  [2-ti^~u)^(V~v)~2] 

which  evaluates  to 

Uv2+Vu2-U  (V+l)  v-V  (U+1)  u  +  ^-UV  (U+V+2 ) 

and  the  average  distance  is  this  divided  by  UV-1.  To  obtain 

the  average  distance  between  a  pair  of  arbitrary  (distinct) 

nodes,  we  must  average  this  result  over  (u,v) ,  i.e.  by  taking 
u  v 

JL  E  E  of  it.  Now  applying  this  to  v2  yields  (V+l)  (2V+D/6; 
UV  u=i  V=1 

to  u2,  (U+1)  (2U+l)/6;  to  v,  (V+l)/2;  and  to  u,  (U+l)/2.  Hence 
our  final  average  is 

g^;[U(V+l)  (2V+l)/6  +  V  (U+1 )  (2U+l)/6 

-  U (V+l) 2/2  -  V(U+1)2  +  UV (U+V+2 ) /2 ] 
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which  evaluates  to  (U+V) /3 .  In  particular,  for  a  square 


array  of  n  nodes  we  have  U=V=y^~,  so  that  the  average  is 
2/n/3;  and  for  a  string  of  n  nodes  we  have  U=n,  V=l,  so 
that  the  average  is  (n+1) /3 . 

3)  Tree 

A  complete  binary  tree  T  of  height  h  has  l,2,...,2h_1 
nodes  at  levels  0,1,..., h-1,  respectively.  For  a  node  N  at 
level  k,  the  sum  of  the  distances  to  the  other  nodes  can  be 
computed  as  follows:  Let  N  be  at  distance  d  from  the  root 
of  a  subtree  T'  of  height  r  which  does  not  contain  N.  Thus 
N  is  at  distance  d+1  from  2  nodes  of  T',  d+2  from  4  nodes, 

...,  and  d+r-1  from  2  nodes.  The  sum  of  the  distances 
from  N  to  the  nodes  of  T'  is  thus 

d  +  2 (d+1)  +  2 2 (d+2)  +  ...  +  2r~1  (d+r-1) 
r-1 

=  d  (2-1)  +  Z  i2x  =  d ( 2r— 1 )  +  (r-2) 2r+2  =  (d+r-2)2r  -  (d-2)  . 
i=l 

Let  N  be  at  level  k;  then  we  can  decompose  T  into  subtrees 


as  follows: 

Distance  from 

y* 

Sum  of  distances 
from  N  to  nodes 

Hoot  of  T' 

root  of  T'  to  N 

Height  of  T' 

of  T' 

N's  brother 

2 

h-k 

(h-k)2h'k 

N's  father's 
brother 

3 

h-k+1 

(h-k+2 )2h~k+1-l 

N's  grand¬ 
father's  bro¬ 
ther 

• 

4 

• 

h-k+2 

• 

(h-k+4)2h~k+2-2 

• 

• 

• 

The  brothe.;  of 
N's  ; ■  ^estor 
just  oelow  the 
root  of  T 

• 

• 

k+1 

• 

• 

h-1 

• 

• 

(h+k-2)2h'1  -  (k-1) 

In  fact,  T  consists  of  these  subtrees  together  with  N's 
father,  grandfather,  . ..,  and  the  root  of  T,  which  have  dis¬ 
tances  1,2,. . . ,k  from  ,  hence  sum  of  distances  k(k+l)/2; 
and  the  subtree  rooted  at  N  itself,  which  has  sum  of  distan¬ 
ces  (h-k-2)2^  k  +  2  from  N.  The  contribution  to  the  sum 
from  the  subtrees  in  the  table  is 

k  ,  .  k-1  -  .  k-1 

(h-k)  £2n_1  +  Z  (2i)  2n”*'t'1  -  Zi 
i-1  i=l  i=l 

=  (h-k)2h_k(2k-l)  +  2h_k+1((k-2)2k+2)  -  k(k-l)/2 


The  total  sum  of  distances  for  a  node  N  at  level  k  is  thus 

(h-k-2)  2h""k  +  2  +  (h-k)  2h-k  (2k-l)  +  2h_k+1  (  (k-2 )  2k+2 )  +  k 
=  (k-2)2h+1  +  (h-k) 2h  +  2h~k+1  +  k  +  2 
=  2h(h+k-4)+2h“k+1  +  k  +  2 

and  the  average  distance  is  this  divided  by  2  -2  (nodes  ^N) . 

To  get  the  average  distance  between  two  arbitrary  nodes,  we 
take  a  weighted  average  of  these  sums,  with  weights  2k  (rep¬ 
resenting  the  2k  nodes  at  level  k,  k=0 , 1 , . . . ,h-l) ,  and  denomi¬ 
nator  2h-l  (the  total  number  of  nodes  in  T) .  This  yields 


1 _ r  ,-h 

r-  u  [  x  2 

(2-1)  (2n-2) 

1  r  ,„h 

H - h - 1  (2 

(2-1) (2-2) 


(h-4 ) +2 ) 

(h-4) -t-2) 


k=0 


(2h-l) 


h  h-1  h-1  ,  ,  . 

+  (2n+l) Z  k2k  +  Z  2h+1] 
k*=0  k=0 

+  (2h+l) ( (h-2)2h+2)  +  h2h+1 


For  a  tree  having  n  nodes,  we  have  n=2^-l,  so  that  this  may  be 
written  as 

j$zrf[(h-3)<n+l)  +  (h+3)]  -  |^±U-t(h-3)n  +  2hJ 
where  h  =  log2 (n+1) . 


Appendix  B:  Array/Tree  and  Tree/Array  Reconfiguration* 


In  Section  4  we  outlined  a  number  of  algorithms  for 
parallel  reconfiguration  of  one  graph  into  another;  but 
some  of  these  algorithms  were  not  the  fastest  possible 
or  did  not  yield  the  best  possible  results  (see  Sections 
4.5  and  4.6).  The  purpose  of  this  appendix  is  to  show  how 
improvements  can  be  achieved  by  using  algorithms  that,  are 
somewhat  more  complicated.  Thus  the  appendix  serves  to 
illustrate  that  straightforward  reconfiguration  algorithms 
may  not  always  be  the  best  ones. 


*The  algorithms  in  this  Appendix  were  developed  with  the  help 
of  Tsvi  Dubitzki. 


Algorithm  B . l :  Reconfiguring  a  two-dimensional  array  into  a 
minimum-height  binary  tree. 

Let  A  be  a  rectangular  array  of  automata  which 
contains  N  nodes  where  N  =  r’S  (r&s)  for  integers  r,s.  D  is 
the  node  at  the  northwest  corner  of  A. 

The  basic  steps  of  the  algorithm  are: 

(1)  Send  a  signal  down  from  D  along  the  leftmost  vertical 
line.  Upon  receipt  of  this  signal,  each  node  below  D  along 
the  vertical  line  sends  a  signal  to  erase  the  series  of  hori¬ 
zontal  arcs  emanating  from  it  in  A.  This  gives  us  an  unbalanced 
binary  tree  with  height  at  most  r  +  s.  We  can  view 

this  tree  as  composed  of  one  horizontal  string  of  length  s 
and  s  vertical  strings  of  length  r  -  1.  (The  distinctions 
between  left,  right  up  and  down  connections  at  each  node  are 
known  in  A. ) 

(2)  D  sends  a  signal  to  order  each  string  to  turn  into 

a  balanced  binary  tree  as  described  in  Section  4.2.  This  takes 
at  most  0(s)  time.  We  now  have  r  +  1  binary  trees:  one  with 
height  O(ilog  sj)  and  s  with  height  O(ilog  (r-l)j).  In  the 
above  process  the  tree  arcs  are  marked. 

(3)  Define  the  tree  with  s  nodes  as  the  "horizontal"  tree 
T  and  the  t  trees  with  (r-1)  nodes  as  "vertical"  trees.  We 
will  hang  the  "vertical"  trees  on  the  leaves  of  the  horizontal 
tree  T.  This  is  done  as  follows: 


D  sends  a  horizontal  triggering  signal  through 
all  the  nodes  of  the  tree  T  in  A.  Upon  arrival  at  a  node  i 
(including  D  itself)  the  signal  causes  node  i  to  check  how 
many  marked  arcs  of  the  tree  are  connected  to  it.  If  that 
number  is  1  or  2  (except  the  root  of  T  which  is  marked  and 
considered  as  a  node  with  3  tree  arcs)  it  means  that  respec¬ 
tively  2  or  1  of  the  "vertical"  trees  can  ba  hung  on  node  i 
in  T.  Then  node  i  sends  (ahead  of  the  triggering  signal)  a 
searching  signal  for  2  or  1  roots  of  "vertical"  trees  either 
through  the  node  below  it  in  A  or  to  the  right,  checking  at 
each  node  whether  the  "vertical"  tree  below  it,  in  A,  is  still 
connected  to  it.  If  it  is  still  connected,  then  it  can  be 
assigned  to  node  i  of  T,  i.e.  node  1  connects  itself  to  the 
roots  of  its  assigned  trees  and  the  arcs  of  A  connecting  these 
"vertical"  trees  to  the  upper  horizontal,  line  oi  A  are  dis¬ 
connected.  All  the  new  connecting  arcs  to  the  roots  of  the 
"vertical"  trees  are  marked  as  tree  arcs.  The  horizontal 
triggering  signal  continues  to  the  right  one  time  unit  after 
the  searching  signal  .  starts,  in  order  to  avoid  too  many  temporary 
connections  at  any  node  of  T.  In  case  the  above  searching  sig¬ 
nal,  starting  at  node  i,  does  not  find  enough  needed  unassigned 
"vertical"  trees  to  its  right,  it  bounces  back  to  the  left  in 
the  upper  horizontal  line  of  A  to  look  for  unassigned  "vertical" 
trees  left  by  the  previous  searching  signals.  This  is  not  done 
when  i  is  the  rightmost  node  in  A's  top  line. 
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(4)  All  the  unmarked  arc*  (of  A)  are  erased  by  a  breadth 
first  search  signal  from  D  sent  down  the  spanning  tree  of  A. 

In  the  following  a  leaf  is  defined  to  be  a  node  which 
does  not  have  two  sons  in  T  and  in  said  to  have  one  or  two 
null  links. 

Claim  2.1.1:  There  are  enough  null  links  at  the  leaves  of  T 
to  hang  all  the  "vertical"  trees  in  A. 

Proof ;  There  are  s  nodes  in  T.  By  induction  the  number  of 
null  links  in  a  binary  tree  with  s  nodes  is  s  +  1.  On  the 
other  hand  there  are  only  s  "vertical"  trees  in  A. 

Corollary;  If  the  rightmost  node  in  A's  top  line  finds  under 
it  one  unassigned  tree  to  be  hung  on  it,  then  it  doesn't  bounce 
a  signal  back  along  A's  top  line  since  Claim  2.1.1  proves  that 
there  is  one  less  "vertical"  tree  in  A  than  needed  to  fill 
all  the  null  links. 

Claim  2,1.2:  The  height  of  the  combined  tree  formed  from  T 
and  the  tree  hanging  from  it  is  at  most  one  unit  more  than  the 
height  of  a  balanced  binary  tree  formed  from  a  string  of 
N  =  s*r  nodes. 

Proof ;  The  height  of  a  balanced  binary  tree  with  N  nodes  is 
h  =  llog2Nj.  The  total  height  of  the  combined  tree  constructed 
by  Algorithm  2.1  will  be 

H  -  1  +  Uog?sJ  +  llog2(r-l)J  *  1+  llog?s j+  llog^r J  *  l+pog^j 
so  that  H  *  h  +  1  . 
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Claim  2.1.3;  Algorithm  2.1  takes  0(s)  time. 

Proof ;  Step  (1)  of  disconnecting  the  horizontal  lines  in  A 
takes  0(a+r)  time. 


Step  (2)  of  converting  all  the  strings  into  binary  trees 
takes  0(s)  time. 

Step  (3)  of  converting  the  binary  trees  into  one  tree 
takes  0(s)  time. 

Step  (4)  of  erasing  nontree  arcs  takes  0(s+r)  time. 
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Algorithm  B.2:  Reconfiguring  a  complete  binary  tree  into  a 
two-dimensional  array. 

Let  T  be  a  complete  binary  tree  of  automata  with  N  nodes. 
Let  D  be  the  root  of  T.  By  a  complete  tree  we  mean  a  tree 
in  which  all  the  paths  from  the  root  to  the  leaves  are  of  the 
same  length.  In  the  following  a  leaf  node  of  T  is  a  node 
with  two  null  links. 

The  basic  steps  of  the  algorithm  are: 

(1)  Conversion  into  a  tree  of  strings : 

In  parallel  D  sends  two  signals  down  T,  one  at  unit  jpeed 

4 

and  the  other  at  1/3  speed.  The  unit  speed  signal  bounces 
back  from  the  leaves  of  T  and  meets  the  1/3  speed  signal  at  a 
node  in  the  middle  of  each  path  from  D  to  the  leaves  of  T. 

Sach  such  meeting  node  marks  itself  and  turns  the 
subtree  rooted  at  it  into  a  string  as  described  in  Section  4.3,. 
The  unit  speed  signals  continue  up  to  D  and  make  it  convert  the 
binary  tree  rooted  at  it  and  having  as  leaves  the  marked  nodes 
into  a  string  also.  We  thus  obtain  a  horizontal  string  (the 
last  one)  with  two  folded  strings  hanging  from  every  other 
node  of  it,  since  in  converting  a  binary  tree  into 
a  string  as  described  in  Section  4.3,  every  two  leaf  nodes  are 
separated  by  a  nonleaf  node,  and  the  above  twofold  strings 
hang  only  on  leaf  nodes.  D  knows  that  the  process  of  turning 
the  specified  subtrees  into  strings  has  terminated  as  soon  as 
it  receives  (from  its  two  sons  in  T)  the  string  generating 


signals  which  bounced  back  from  T's  leaves.  All  the  arcs 
of  T  not  participating  in  the  above  construction  are  erased 
as  follows:  D  send  breadth  first  erasing  signals  down  in  T. 

The  signals  bounce  back  from  the  leaves  towards  D  and  on 
their  way  back  erase  every  arc  of  T  except  the  first  level  of 
arcs  above  the  leaves  and  above  the  marked  nodes. 

(2)  Formation  of  a  pseudo-array: 

D  orders  every  hanging  point  (in  the  horizontal  string 
of  (1))  of  a  twofold  string  to  order  the  first  node  in  the 
right  part  of  the  twofold  string  hanging  from  it  to  connect 
itself  to  the  node  to  its  right  and  then  disconnect  itself  from 
its  old  hanging  point.  The  rightmost  node  of  the  horizontal 
string  doesn't  have  a  nonleaf  node  to  its  right  and  therefore 
orders  its  right  neighbor  in  the  twofold  string  hanging  from 
it  to  be  a  new  hanging  point  to  its  right  (thus  part  of  the 
horizontal  string)  from  which  hangs  the  rest  of  the  right  part 
of  that  rightmost  twofold  string.  We  now  have  a 
binary  tree  composed  of  a  set  of  strings  hanging  vertically 
from  a  horizontal  string.  This  binary  tree  is  a  "pseudo-array" 
and  we  need  only  generate  the  horizontal  connections  in  it  in 
order  to  get  an  array.  Note  that  the  rightmost  hanging  string 
is  one  node  shorter  than  the  other  hanging  strings. 

(3)  Conversion  into  an  array: 

First  we  define  for  each  node  in  the  pseudo-array  of  step 
(2)  what  its  upward, downward  and  horizontal  connections  are. 
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For  this  purpose  D  sends  a  breadth-first  search  signal  down 
the  pseudo-array.  The  signals  bounce  back  from  the  bottom 
nodes  of  the  vertical  strings  and  go  back  up  in 
the  strings  of  the  pseudo-array.  Each  entrance  to  a  node 
in  this  path  is  a  downward  connection  and  each  exit  an  upward 
connection.  Upon  arriving  at  the  marked  nodes  of  step  (.1) 
the  definitions  of  the  connections  change  to  horizontal  until 
the  signals  reach  D  again.  Each  node  in  the  horizontal  line 
will  not  emit  a  signal  in  the  horizontal  direction  towards  D 
until  it  has  received  a  horizontal  signal.  Thus  upon  receiving 
two  signals  D  will  know  that  this  marking  process  has  termi¬ 
nated.  At  this  stage  D  orders  each  of  its  horizontal  neighbors 
to  connect  itself  temporarily  to  the  node  on  its  downward  con¬ 
nection.  Then  each  of  the  horizontal  neighbors  of  D 
orders  its  vertical  neighbors  and  the  node  below  D  to  connect 
themselves.  The  above  temporary  connections  are  then  discon¬ 
nected.  In  turn  each  horizontal  neighbor  of  D  starts  such  a 
connecting  process  too.  This  process  propagates  in  the  first 
upper  row  of  the  pseudo-array;  at  the  same  time  each  node 
below  that  row,  having  established  a  horizontal  arc,  starts 
such  a  process  in  the  row  below  it,  and  so  on  until  the  net¬ 
work  of  horizontal  arcs  in  D  is  completed. 

Claim  2.2.1:  The  length  of  the  string  formed  from  the  upper 
part  of  T  (the  upper  row  of  the  final  array)  is  0(/N) . 


Proof ;  The  number  of  nodes  in  T  is  N  which  equals  -  1 

in  a  complete  binary  tree  with  height  h.  The  marked  nodes  in 
step  (1)  of  Algorithm  2.2  divide  T  into  an  upper  complete  tree 
with  height  h/2  and  the  rest  of  T.  In  that  upper  part  of  T 
we  have  N'  -  -  l  nodes.  Therefore  N'  is  0(/N). 

Claim  2.2.2:  Each  hanging  point  in  the  pseudo-array  of 
step  (2)  of  Algorithm  2.2  is  the  middle  of  the  twofold  string 
hanging  from  it  and  the  lengths  of  all  the  twofold  strings  in 
the  pseudo-array  are  equal. 

Proof :  A  complete  binary  tree  has  equal  numbers  of  nodes  in 
the  right  subtree  and  left  subtree  of  its  root.  The  subtrees 
forming  the  twofold  strings  in  step  (1)  of  Algorithm  2.2  are 
complete  binary  trees.  The  process  of  converting  a  binary 
tree  into  a  string  produces  a  string  in  which  the  root  of 
the  tree  is  an  internal  point,  all  the  nodes  to  its  right  come 
from  the  right  subtree  of  the  root  and  all  the  nodes  to  its 
left  come  from  its  left  subtree.  Thus  the  root  of  the  tree 
(a  hanging  point)  is  the  middle  of  the  twofold  string.  The 
lengths  of  all  the  twofold  strings  in  the  pseudo-array  are  equal 
since  all  the  marked  nodes  of  step  (1)  are  at  the  same  depth 
below  D  and  hence  all  the  subtrees  below  them  are  of  the  same 
size. 

Corollary:  The  array  formed  in  step  (3)  of  Algorithm  2.2  is 

of  size  0(/N>  x  0(/N).  This  is  due  to  the  fact  that  the  upper 
horizontal  line  of  the  array  contains  0(/N)  nodes  by  Claim  2.2.1 


and  the  lengths  of  all  the  vertical  strings  hanging  from  the 
horizontal  line  of  step  (2)  are  equal  by  Claim  2.2.2. 

Note  that  Algorithm  2.2  is  applicable  with  slight  changes 
to  non-complete  balanced  binary  trees.  In  particular  if  we 
are  dealing  with  height-balanced  binary  trees  with  minimal 
numbers  of  nodes,  then  the  upper  horizontal  line  of  the  array 
holds  less  than  /ft  nodes  since  the  marked  nodes  of  step  (1) 
(closest  to  the  root)  are  now  closer  to  D  than  in  the  case  of 
a  complete  binary  tree  because  of  the  existence  of  short 
paths  going  through  a  node  to  the  leaves  of  T.  Also  the 
difference  in  length  between  the  vertical  hanging  strings 
grows  with  N  since  we  are  dealing  with  subtrees  (generating  the 
twofold  strings)  which  differ  more  and  more  in  their  numbers 
of  nodes  as  the  height  of  T  grows.  These  factors  give  us 
finally  very  incomplete  rectangular  arrays. 

Claim  2.2.3;  Algorithm  2.2  takes  0(/N)  time. 

Proof :  Step  (1)  of  constructing  the  tr>  j  of  string?  takes 

O(log  N)  time.  Step  (2)  of  constructing  the  pseudo¬ 
array  takes  conoLant  time.  Step  (3)  of  forming  the  horizon¬ 
tal  lines  of  the  array  takes  0(/N)  time  since  we  already  have 
a  skeleton  of  an  array  of  size  0(/N)  x  0(/N) . 
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In  llular  computers,  e.g.  as  they  might  be  used  for  parallel 
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the  P's  all  perform  similar  computations  on  different  parts 
of  the  input  data,  and  by  allowing  each  P.  to  receive 
information  from  a  fixed  set  of  the  otheri  (its  "neighbors"), 
where  these  sets  are  all  of  bounded  size.  This  paper 
discusses,  on  an  abstract  level,  the  concept  of  a  recon- 
figurable  cellular  computer,  in  which  each  P.  can  receive 
information  from  a  set  S.  of  the  other  P'  ,  £nd  the  S.'s 
are  all  of  bounded  size^but  they  need  no?  remain  fixid 
throughout  a  computation.  Requiring  the  S^'s  to  have 
bounded  size  implies  that  most  P's  cannot  communicate 
directly;  the  expected  time  required  for  two  arbitrary  P's 
to  communicate  depends  on  the  graph  structure  defined  by  the 
sets  Sj_.  The  question  of  how  to  change  the  S^'s  in  parallel 
during  the  course  of  a  computation  is  also  discussed. 


